Comparative Analysis of Data Mining Tools and Classification Techniques using WEKA in Medical Bioinformatics
نویسنده
چکیده
The availability of huge amounts of data resulted in great need of data mining technique in order to generate useful knowledge. In the present study we provide detailed information about data mining techniques with more focus on classification techniques as one important supervised learning technique. We also discuss WEKA software as a tool of choice to perform classification analysis for different kinds of available data. A detailed methodology is provided to facilitate utilizing the software by a wide range of users. The main features of WEKA are 49 data preprocessing tools, 76 classification/regression algorithms, 8 clustering algorithms, 3 algorithms for finding association rules, 15 attribute/subset evaluators plus 10 search algorithms for feature selection. WEKA extracts useful information from data and enables a suitable algorithm for generating an accurate predictive model from it to be identified. Moreover, medical bioinformatics analyses have been performed to illustrate the usage of WEKA in the diagnosis of Leukemia.
منابع مشابه
Comparison of Various Classification Techniques Using Different Data Mining Tools for Diabetes Diagnosis
In the absence of medical diagnosis evidences, it is difficult for the experts to opine about the grade of disease with affirmation. Generally many tests are done that involve clustering or classification of large scale data. However many tests could complicate the main diagnosis process and lead to the difficulty in obtaining the end results, particularly in the case where many tests are perfo...
متن کاملPerformance Analysis of Different Classification Methods in Data Mining for Diabetes Dataset Using WEKA Tool
Data mining is the process of analyzing data based on different perspectives and summarizing it into useful information. Classification is one of the generally used techniques in medical data mining. The goal here is to discover new patterns to provide meaningful and useful information for the users. Recently data mining techniques are applied to healthcare datasets to explore suitable methods ...
متن کاملGene Expression Data Analysis Using Data Mining Algorithms for Colon Cancer
The concept of Data mining is used in various medical applications like tumor classification, protein structure prediction, gene classification, cancer classification based on microarray data, clustering of gene expression data, statistical model of protein-protein interaction etc. Adverse drug events in prediction of medical test effectiveness can be done based on genomics and proteomics throu...
متن کاملPerformance Analysis of Engineering Students for Recruitment Using Classification Data Mining Techniques
-Data Mining is a powerful tool for academic intervention. Mining in education environment is called Educational Data Mining. Educational Data Mining is concerned with developing new methods to discover knowledge from educational database and can used for decision making in educational system. In our work, we collected the student’s data from engineering institute that have different informatio...
متن کاملWEKA Approach for Comparative Study of Classification Algorithm
This paper discusses data mining techniques to process a dataset and identify the relevance of classification test data. Mining tools to solve large amounts of problems such as classification, clustering, association rule, neural networks, it is a open access tools directly communicates with each tool or called from java code to implement using this. In this paper we present machine learning da...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013